My history here
- Postdoc with Rafa Irizarry at JHU 2008-2010
- Spoke at symposium 2009 (I think)
- Won "worst poster" award at symposium 2008, still wear that shirt proudly
October 13, 2016
My history here
metagenomeSeqmetagenomeFeaturesantiProfilesminfibumphunterHTShapeqsmoothRcplexRcsdpCollaborative and exploratory analysis
Bsmooth, minfi)epivizr packageCreativity in exploration
We are building software applications to support creative exploratory analysis of large genome-wide datasets…
Statistically-guided exploration: Calculate a statistic of interest
# Get tumor methylation base-pair data m <- assay(se)[,"tumor"] # Compute regions with highest variability across cpgs region_stat <- calcWindowStat(m, step=25, window=80, stat=rowSds) s <- region_stat[,"stat"]
Explore data based on statistic
What's around the regions with highest across CpG variability?
# get locations in decreasing order o <- order(s, decreasing=TRUE) indices <- region_stat[o, "indices"] slideShowRegions <- rowRanges(se)[indices] + 1250000L mgr$slideshow(slideShowRegions)
dynamically extensible: Easily integrate new data types and add new visualizations.
Visualization design goals
Visualization goals
One interpretation of Big Data is Many relevant sources of contextual data
metagenomeSeq, metagenomicFeatures, metavizWhat is the measurement?
What is the measurement?
Features:
Hierachically organized features
Hierarchically organized features
Defining the measurement unit of analysis
Not just features, but samples may be hierarchically organized
metagenomeSeq, metagenomeFeatures, metavizr)Acknowledgements
Justin Wagner, Jayaram Kancherla (CBCB)
Florin Chelaru (now at Twinfog), Joseph Paulson (now at Harvard)
Mihai Pop (CBCB) Feinberg Lab & K. Hansen (JHU), R. Irizarry (Harvard) HMP2 Project (Xavier and Huttenhower, Harvard)
Funding: NIH, Genentech, Gates Foundation
More information